Comparison of Support Vector Machine Performance with Oversampling and Outlier Handling in Diabetic Disease Detection Classification

نویسندگان

چکیده

Diabetes mellitus is a disease that attacks chronic metabolism, characterized by the body’s inability to process carbohydrates, fats so glucose levels are high. sixth cause of death in world. Classifying data about diabetes makes it easier predict disease. As technology develops, can be detected using machine learning methods. The method done support vector machine. advantage SVM very effective completing classification, quickly separate each positive and negative point. This study aimed obtain best classification model based on accuracy, sensitivity, precision values detecting adding Synthetic Minority Over-Sampling Technique (SMOTE) handling outliers. SMOTE was applied handle class imbalance. Support Vector Machine (SVM) produce function as dividing line or what called hyperplane matches all input with smallest possible error. studied were indications diabetes, consisting 8-factor variables 1 variable. test results show SVM-SMOTE scenario produces accuracy. produced an accuracy value RBF kernel 88% error 12%, this obtained from division training 90:10. 0.880 sensitivity 0.880. research showed factor more accurate if carried out imbalance (SMOTE), concluded distribution influences scenario.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outlier Detection for Support Vector Machine using Minimum Covariance Determinant Estimator

The purpose of this paper is to identify the effective points on the performance of one of the important algorithm of data mining namely support vector machine. The final classification decision has been made based on the small portion of data called support vectors. So, existence of the atypical observations in the aforementioned points, will result in deviation from the correct decision. Thus...

متن کامل

Comparison of classic regression methods with neural network and support vector machine in classifying groundwater resources

In the present era, classification of data is one of the most important issues in various sciences in order to detect and predict events. In statistics, the traditional view of these classifications will be based on classic methods and statistical models such as logistic regression. In the present era, known as the era of explosion of information, in most cases, we are faced with data that c...

متن کامل

Patient classification as an outlier detection problem: An application of the One-Class Support Vector Machine

Pattern recognition approaches, such as the Support Vector Machine (SVM), have been successfully used to classify groups of individuals based on their patterns of brain activity or structure. However these approaches focus on finding group differences and are not applicable to situations where one is interested in accessing deviations from a specific class or population. In the present work we ...

متن کامل

Common Spatial Patterns Feature Extraction and Support Vector Machine Classification for Motor Imagery with the SecondBrain

Recently, a large set of electroencephalography (EEG) data is being generated by several high-quality labs worldwide and is free to be used by all researchers in the world. On the other hand, many neuroscience researchers need these data to study different neural disorders for better diagnosis and evaluating the treatment. However, some format adaptation and pre-processing are necessary before ...

متن کامل

Acoustic detection of apple mealiness based on support vector machine

Mealiness degrades the quality of apples and plays an important role in fruit market. Therefore, the use of reliable and rapid sensing techniques for nondestructive measurement and sorting of fruits is necessary. In this study, the potential of acoustic signals of rolling apples on an inclined plate as a new technique for nondestructive detection of Red Delicious apple mealiness was investigate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Matrik: jurnal manajemen, teknik informatika, dan rekayasa komputer

سال: 2023

ISSN: ['2476-9843']

DOI: https://doi.org/10.30812/matrik.v22i3.2979